Dataset statistics
| Number of variables | 18 |
|---|---|
| Number of observations | 220.031 |
| Missing cells | 364.423 |
| Missing cells (%) | 9.2% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 30.2 MiB |
| Average record size in memory | 144.0 B |
Variable types
| Numeric | 9 |
|---|---|
| Text | 3 |
| Categorical | 3 |
| DateTime | 1 |
| Unsupported | 2 |
city is highly overall correlated with latitude and 2 other fields | High correlation |
host_id is highly overall correlated with id | High correlation |
id is highly overall correlated with host_id | High correlation |
latitude is highly overall correlated with city and 1 other fields | High correlation |
longitude is highly overall correlated with city and 1 other fields | High correlation |
neighbourhood_group is highly overall correlated with city and 3 other fields | High correlation |
number_of_reviews is highly overall correlated with reviews_per_month | High correlation |
price is highly overall correlated with neighbourhood_group and 1 other fields | High correlation |
price(€) is highly overall correlated with price | High correlation |
reviews_per_month is highly overall correlated with number_of_reviews | High correlation |
last_review has 54371 (24.7%) missing values | Missing |
reviews_per_month has 54371 (24.7%) missing values | Missing |
neighbourhood_group has 151518 (68.9%) missing values | Missing |
city has 103390 (47.0%) missing values | Missing |
price is highly skewed (γ1 = 85.87746722) | Skewed |
minimum_nights is highly skewed (γ1 = 26.59625713) | Skewed |
price(€) is highly skewed (γ1 = 21.19928567) | Skewed |
id has unique values | Unique |
calculated_host_listings_count is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
availability_365 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
number_of_reviews has 54248 (24.7%) zeros | Zeros |
Reproduction
| Analysis started | 2024-10-16 07:56:36.732979 |
|---|---|
| Analysis finished | 2024-10-16 07:57:01.546078 |
| Duration | 24.81 seconds |
| Software version | ydata-profiling vv4.10.0 |
| Download configuration | config.json |
id
Real number (ℝ)
HIGH CORRELATION  UNIQUE 
| Distinct | 220031 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 22408313 |
| Minimum | 2539 |
|---|---|
| Maximum | 50955051 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 2539 |
|---|---|
| 5-th percentile | 2627945.5 |
| Q1 | 13383695 |
| median | 22497889 |
| Q3 | 31554452 |
| 95-th percentile | 39695005 |
| Maximum | 50955051 |
| Range | 50952512 |
| Interquartile range (IQR) | 18170757 |
Descriptive statistics
| Standard deviation | 11754902 |
|---|---|
| Coefficient of variation (CV) | 0.52457773 |
| Kurtosis | -0.81637171 |
| Mean | 22408313 |
| Median Absolute Deviation (MAD) | 9090535 |
| Skewness | -0.045580406 |
| Sum | 4.9305235 × 1012 |
| Variance | 1.3817772 × 1014 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 36487245 | 1 | < 0.1% |
| 6400 | 1 | < 0.1% |
| 23986 | 1 | < 0.1% |
| 28300 | 1 | < 0.1% |
| 32119 | 1 | < 0.1% |
| 32649 | 1 | < 0.1% |
| 37256 | 1 | < 0.1% |
| 40470 | 1 | < 0.1% |
| 42732 | 1 | < 0.1% |
| 46536 | 1 | < 0.1% |
| Other values (220021) | 220021 |
| Value | Count | Frequency (%) |
| 2539 | 1 | |
| 2595 | 1 | |
| 3647 | 1 | |
| 3831 | 1 | |
| 5022 | 1 | |
| 5099 | 1 | |
| 5121 | 1 | |
| 5178 | 1 | |
| 5203 | 1 | |
| 5238 | 1 |
| Value | Count | Frequency (%) |
| 50955051 | 1 | |
| 50950278 | 1 | |
| 50934102 | 1 | |
| 50932398 | 1 | |
| 50932336 | 1 | |
| 50931203 | 1 | |
| 50929172 | 1 | |
| 50928814 | 1 | |
| 50928474 | 1 | |
| 50927557 | 1 |
name
Text
| Distinct | 213293 |
|---|---|
| Distinct (%) | 97.0% |
| Missing | 67 |
| Missing (%) | < 0.1% |
| Memory size | 1.7 MiB |
Length
| Max length | 259 |
|---|---|
| Median length | 165 |
| Mean length | 37.531196 |
| Min length | 1 |
Characters and Unicode
| Total characters | 8.255.512 |
|---|---|
| Distinct characters | 2.303 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 209.517 ? |
|---|---|
| Unique (%) | 95.3% |
Sample
| 1st row | The Studio Milan |
|---|---|
| 2nd row | " Characteristic Milanese flat" |
| 3rd row | nice flat near the park |
| 4th row | Nico & Cynthia's Easy Yellow Suite |
| 5th row | Nico&Cinzia's Red Easy Suite! |
| Value | Count | Frequency (%) |
| in | 64425 | 4.8% |
| room | 40633 | 3.0% |
| 37234 | 2.8% | |
| apartment | 31574 | 2.4% |
| bedroom | 25575 | 1.9% |
| flat | 20533 | 1.5% |
| to | 19286 | 1.4% |
| with | 17410 | 1.3% |
| 2 | 17172 | 1.3% |
| private | 16697 | 1.2% |
| Other values (53153) | 1052387 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1130782 | 13.7% | |
| e | 605345 | 7.3% |
| o | 584481 | 7.1% |
| a | 504894 | 6.1% |
| t | 469537 | 5.7% |
| n | 465533 | 5.6% |
| i | 428172 | 5.2% |
| r | 420095 | 5.1% |
| l | 260230 | 3.2% |
| s | 215491 | 2.6% |
| Other values (2293) | 3170952 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 8255512 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1130782 | 13.7% | |
| e | 605345 | 7.3% |
| o | 584481 | 7.1% |
| a | 504894 | 6.1% |
| t | 469537 | 5.7% |
| n | 465533 | 5.6% |
| i | 428172 | 5.2% |
| r | 420095 | 5.1% |
| l | 260230 | 3.2% |
| s | 215491 | 2.6% |
| Other values (2293) | 3170952 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 8255512 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1130782 | 13.7% | |
| e | 605345 | 7.3% |
| o | 584481 | 7.1% |
| a | 504894 | 6.1% |
| t | 469537 | 5.7% |
| n | 465533 | 5.6% |
| i | 428172 | 5.2% |
| r | 420095 | 5.1% |
| l | 260230 | 3.2% |
| s | 215491 | 2.6% |
| Other values (2293) | 3170952 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 8255512 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1130782 | 13.7% | |
| e | 605345 | 7.3% |
| o | 584481 | 7.1% |
| a | 504894 | 6.1% |
| t | 469537 | 5.7% |
| n | 465533 | 5.6% |
| i | 428172 | 5.2% |
| r | 420095 | 5.1% |
| l | 260230 | 3.2% |
| s | 215491 | 2.6% |
| Other values (2293) | 3170952 |
host_id
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 144510 |
|---|---|
| Distinct (%) | 65.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 84945278 |
| Minimum | 1944 |
|---|---|
| Maximum | 4.1172076 × 108 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 1944 |
|---|---|
| 5-th percentile | 1788451.5 |
| Q1 | 14396024 |
| median | 46403919 |
| Q3 | 1.4150968 × 108 |
| 95-th percentile | 2.6320938 × 108 |
| Maximum | 4.1172076 × 108 |
| Range | 4.1171882 × 108 |
| Interquartile range (IQR) | 1.2711365 × 108 |
Descriptive statistics
| Standard deviation | 88566074 |
|---|---|
| Coefficient of variation (CV) | 1.042625 |
| Kurtosis | 0.25493533 |
| Mean | 84945278 |
| Median Absolute Deviation (MAD) | 40903928 |
| Skewness | 1.1026115 |
| Sum | 1.8690594 × 1013 |
| Variance | 7.8439494 × 1015 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 33889201 | 999 | 0.5% |
| 219517861 | 327 | 0.1% |
| 27693585 | 314 | 0.1% |
| 137094377 | 236 | 0.1% |
| 28820321 | 233 | 0.1% |
| 156158778 | 232 | 0.1% |
| 107434423 | 232 | 0.1% |
| 48165024 | 213 | 0.1% |
| 36410227 | 197 | 0.1% |
| 1432477 | 183 | 0.1% |
| Other values (144500) | 216865 |
| Value | Count | Frequency (%) |
| 1944 | 1 | < 0.1% |
| 2438 | 1 | < 0.1% |
| 2571 | 1 | < 0.1% |
| 2697 | 1 | < 0.1% |
| 2787 | 6 | |
| 2845 | 2 | < 0.1% |
| 2868 | 1 | < 0.1% |
| 2881 | 2 | < 0.1% |
| 3151 | 1 | < 0.1% |
| 3211 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 411720762 | 1 | |
| 411489946 | 1 | |
| 411016508 | 1 | |
| 410486696 | 1 | |
| 410265727 | 1 | |
| 410205054 | 1 | |
| 410041372 | 1 | |
| 409925612 | 1 | |
| 409413422 | 1 | |
| 409051230 | 1 |
host_name
Text
| Distinct | 31499 |
|---|---|
| Distinct (%) | 14.4% |
| Missing | 706 |
| Missing (%) | 0.3% |
| Memory size | 1.7 MiB |
Length
| Max length | 35 |
|---|---|
| Median length | 33 |
| Mean length | 6.5376724 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1.433.875 |
|---|---|
| Distinct characters | 898 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 17.662 ? |
|---|---|
| Unique (%) | 8.1% |
Sample
| 1st row | Francesca |
|---|---|
| 2nd row | Jeremy |
| 3rd row | Marta |
| 4th row | Nico&Cinzia |
| 5th row | Nico&Cinzia |
| Value | Count | Frequency (%) |
| 5552 | 2.2% | |
| and | 2571 | 1.0% |
| david | 1559 | 0.6% |
| maria | 1472 | 0.6% |
| veeve | 1235 | 0.5% |
| anna | 1202 | 0.5% |
| laura | 1183 | 0.5% |
| michael | 1182 | 0.5% |
| alex | 1152 | 0.5% |
| sarah | 1087 | 0.4% |
| Other values (25792) | 237634 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 182992 | 12.8% |
| e | 130654 | 9.1% |
| i | 117780 | 8.2% |
| n | 105165 | 7.3% |
| r | 82510 | 5.8% |
| o | 73642 | 5.1% |
| l | 72231 | 5.0% |
| t | 47514 | 3.3% |
| s | 47237 | 3.3% |
| 36755 | 2.6% | |
| Other values (888) | 537395 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1433875 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 182992 | 12.8% |
| e | 130654 | 9.1% |
| i | 117780 | 8.2% |
| n | 105165 | 7.3% |
| r | 82510 | 5.8% |
| o | 73642 | 5.1% |
| l | 72231 | 5.0% |
| t | 47514 | 3.3% |
| s | 47237 | 3.3% |
| 36755 | 2.6% | |
| Other values (888) | 537395 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1433875 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 182992 | 12.8% |
| e | 130654 | 9.1% |
| i | 117780 | 8.2% |
| n | 105165 | 7.3% |
| r | 82510 | 5.8% |
| o | 73642 | 5.1% |
| l | 72231 | 5.0% |
| t | 47514 | 3.3% |
| s | 47237 | 3.3% |
| 36755 | 2.6% | |
| Other values (888) | 537395 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1433875 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 182992 | 12.8% |
| e | 130654 | 9.1% |
| i | 117780 | 8.2% |
| n | 105165 | 7.3% |
| r | 82510 | 5.8% |
| o | 73642 | 5.1% |
| l | 72231 | 5.0% |
| t | 47514 | 3.3% |
| s | 47237 | 3.3% |
| 36755 | 2.6% | |
| Other values (888) | 537395 |
neighbourhood
Text
| Distinct | 562 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 MiB |
Length
| Max length | 28 |
|---|---|
| Median length | 24 |
| Mean length | 10.287673 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2.263.607 |
|---|---|
| Distinct characters | 66 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 11 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | TIBALDI |
|---|---|
| 2nd row | NAVIGLI |
| 3rd row | SARPI |
| 4th row | VIALE MONZA |
| 5th row | VIALE MONZA |
| Value | Count | Frequency (%) |
| ku | 11185 | 3.5% |
| sydney | 10611 | 3.3% |
| and | 10600 | 3.3% |
| westminster | 9588 | 3.0% |
| tower | 8246 | 2.6% |
| hamlets | 8246 | 2.6% |
| chelsea | 7131 | 2.2% |
| east | 6592 | 2.1% |
| hackney | 6276 | 2.0% |
| kensington | 6193 | 1.9% |
| Other values (648) | 234273 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 196710 | 8.7% |
| a | 179738 | 7.9% |
| n | 144620 | 6.4% |
| t | 124698 | 5.5% |
| s | 117550 | 5.2% |
| i | 116979 | 5.2% |
| r | 113911 | 5.0% |
| 98910 | 4.4% | |
| o | 93639 | 4.1% |
| l | 92648 | 4.1% |
| Other values (56) | 984204 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2263607 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 196710 | 8.7% |
| a | 179738 | 7.9% |
| n | 144620 | 6.4% |
| t | 124698 | 5.5% |
| s | 117550 | 5.2% |
| i | 116979 | 5.2% |
| r | 113911 | 5.0% |
| 98910 | 4.4% | |
| o | 93639 | 4.1% |
| l | 92648 | 4.1% |
| Other values (56) | 984204 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2263607 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 196710 | 8.7% |
| a | 179738 | 7.9% |
| n | 144620 | 6.4% |
| t | 124698 | 5.5% |
| s | 117550 | 5.2% |
| i | 116979 | 5.2% |
| r | 113911 | 5.0% |
| 98910 | 4.4% | |
| o | 93639 | 4.1% |
| l | 92648 | 4.1% |
| Other values (56) | 984204 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2263607 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 196710 | 8.7% |
| a | 179738 | 7.9% |
| n | 144620 | 6.4% |
| t | 124698 | 5.5% |
| s | 117550 | 5.2% |
| i | 116979 | 5.2% |
| r | 113911 | 5.0% |
| 98910 | 4.4% | |
| o | 93639 | 4.1% |
| l | 92648 | 4.1% |
| Other values (56) | 984204 |
latitude
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 98672 |
|---|---|
| Distinct (%) | 44.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 32.573053 |
| Minimum | -34.135212 |
|---|---|
| Maximum | 51.68169 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 36662 |
| Negative (%) | 16.7% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | -34.135212 |
|---|---|
| 5-th percentile | -33.893956 |
| Q1 | 40.41262 |
| median | 40.79424 |
| Q3 | 51.4962 |
| 95-th percentile | 51.55527 |
| Maximum | 51.68169 |
| Range | 85.816902 |
| Interquartile range (IQR) | 11.08358 |
Descriptive statistics
| Standard deviation | 30.144854 |
|---|---|
| Coefficient of variation (CV) | 0.92545375 |
| Kurtosis | 0.9956733 |
| Mean | 32.573053 |
| Median Absolute Deviation (MAD) | 10.66852 |
| Skewness | -1.6752658 |
| Sum | 7167081.3 |
| Variance | 908.71221 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 40.42165 | 34 | < 0.1% |
| 45.4884 | 26 | < 0.1% |
| 51.49455 | 24 | < 0.1% |
| 51.51459 | 24 | < 0.1% |
| 51.51571 | 23 | < 0.1% |
| 51.5138 | 23 | < 0.1% |
| 51.51307 | 22 | < 0.1% |
| 51.51375 | 22 | < 0.1% |
| 51.51343 | 22 | < 0.1% |
| 51.52486 | 22 | < 0.1% |
| Other values (98662) | 219789 |
| Value | Count | Frequency (%) |
| -34.1352122 | 1 | |
| -34.12623664 | 1 | |
| -34.09899758 | 1 | |
| -34.09853882 | 1 | |
| -34.09440025 | 1 | |
| -34.09254562 | 1 | |
| -34.08949817 | 1 | |
| -34.08949651 | 1 | |
| -34.08937477 | 1 | |
| -34.08848048 | 1 |
| Value | Count | Frequency (%) |
| 51.68169 | 1 | |
| 51.6792 | 1 | |
| 51.67651 | 1 | |
| 51.67566 | 1 | |
| 51.67501 | 1 | |
| 51.67393 | 1 | |
| 51.67344 | 1 | |
| 51.67297 | 1 | |
| 51.67219 | 1 | |
| 51.67215 | 1 |
longitude
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 108328 |
|---|---|
| Distinct (%) | 49.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 16.428135 |
| Minimum | -74.24442 |
|---|---|
| Maximum | 151.33981 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 147948 |
| Negative (%) | 67.2% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | -74.24442 |
|---|---|
| 5-th percentile | -73.98507 |
| Q1 | -3.70587 |
| median | -0.12838 |
| Q3 | 9.199535 |
| 95-th percentile | 151.25387 |
| Maximum | 151.33981 |
| Range | 225.58423 |
| Interquartile range (IQR) | 12.905405 |
Descriptive statistics
| Standard deviation | 76.030471 |
|---|---|
| Coefficient of variation (CV) | 4.6280646 |
| Kurtosis | -0.54913921 |
| Mean | 16.428135 |
| Median Absolute Deviation (MAD) | 9.30053 |
| Skewness | 0.77143012 |
| Sum | 3614699 |
| Variance | 5780.6325 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -3.707 | 37 | < 0.1% |
| 9.21423 | 26 | < 0.1% |
| -3.70217 | 20 | < 0.1% |
| -3.70409 | 19 | < 0.1% |
| -3.70712 | 19 | < 0.1% |
| -3.70443 | 19 | < 0.1% |
| 9.19091 | 18 | < 0.1% |
| -73.95427 | 18 | < 0.1% |
| -3.70502 | 18 | < 0.1% |
| -73.95677 | 18 | < 0.1% |
| Other values (108318) | 219819 |
| Value | Count | Frequency (%) |
| -74.24442 | 1 | |
| -74.24285 | 1 | |
| -74.24084 | 1 | |
| -74.23986 | 1 | |
| -74.23914 | 1 | |
| -74.23803 | 1 | |
| -74.23059 | 1 | |
| -74.21238 | 1 | |
| -74.21017 | 1 | |
| -74.20941 | 1 |
| Value | Count | Frequency (%) |
| 151.3398112 | 1 | |
| 151.339805 | 1 | |
| 151.3397888 | 1 | |
| 151.3397674 | 1 | |
| 151.3396904 | 1 | |
| 151.3396779 | 1 | |
| 151.3395821 | 1 | |
| 151.3395529 | 1 | |
| 151.3395483 | 1 | |
| 151.3392876 | 1 |
room_type
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 MiB |
| Entire home/apt | |
|---|---|
| Private room | |
| Shared room | 4012 |
| Hotel room | 1353 |
Length
| Max length | 15 |
|---|---|
| Median length | 15 |
| Mean length | 13.716776 |
| Min length | 10 |
Characters and Unicode
| Total characters | 3.018.116 |
|---|---|
| Distinct characters | 19 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Private room |
|---|---|
| 2nd row | Entire home/apt |
| 3rd row | Private room |
| 4th row | Entire home/apt |
| 5th row | Entire home/apt |
Common Values
| Value | Count | Frequency (%) |
| Entire home/apt | 128154 | |
| Private room | 86512 | |
| Shared room | 4012 | 1.8% |
| Hotel room | 1353 | 0.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| entire | 128154 | |
| home/apt | 128154 | |
| room | 91877 | |
| private | 86512 | |
| shared | 4012 | 0.9% |
| hotel | 1353 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 348185 | |
| t | 344173 | |
| o | 313261 | |
| r | 310555 | |
| 220031 | 7.3% | |
| m | 220031 | 7.3% |
| a | 218678 | 7.2% |
| i | 214666 | 7.1% |
| h | 132166 | 4.4% |
| n | 128154 | 4.2% |
| Other values (9) | 568216 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3018116 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 348185 | |
| t | 344173 | |
| o | 313261 | |
| r | 310555 | |
| 220031 | 7.3% | |
| m | 220031 | 7.3% |
| a | 218678 | 7.2% |
| i | 214666 | 7.1% |
| h | 132166 | 4.4% |
| n | 128154 | 4.2% |
| Other values (9) | 568216 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3018116 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 348185 | |
| t | 344173 | |
| o | 313261 | |
| r | 310555 | |
| 220031 | 7.3% | |
| m | 220031 | 7.3% |
| a | 218678 | 7.2% |
| i | 214666 | 7.1% |
| h | 132166 | 4.4% |
| n | 128154 | 4.2% |
| Other values (9) | 568216 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3018116 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 348185 | |
| t | 344173 | |
| o | 313261 | |
| r | 310555 | |
| 220031 | 7.3% | |
| m | 220031 | 7.3% |
| a | 218678 | 7.2% |
| i | 214666 | 7.1% |
| h | 132166 | 4.4% |
| n | 128154 | 4.2% |
| Other values (9) | 568216 |
price
Real number (ℝ)
HIGH CORRELATION  SKEWED 
| Distinct | 1566 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 917.8157 |
| Minimum | 0 |
|---|---|
| Maximum | 1000046 |
| Zeros | 50 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 29 |
| Q1 | 55 |
| median | 99 |
| Q3 | 177 |
| 95-th percentile | 3013 |
| Maximum | 1000046 |
| Range | 1000046 |
| Interquartile range (IQR) | 122 |
Descriptive statistics
| Standard deviation | 8285.2166 |
|---|---|
| Coefficient of variation (CV) | 9.0271026 |
| Kurtosis | 9849.2478 |
| Mean | 917.8157 |
| Median Absolute Deviation (MAD) | 51 |
| Skewness | 85.877467 |
| Sum | 2.0194791 × 108 |
| Variance | 68644813 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 100 | 7761 | 3.5% |
| 50 | 7007 | 3.2% |
| 150 | 6719 | 3.1% |
| 60 | 5996 | 2.7% |
| 80 | 5439 | 2.5% |
| 120 | 5120 | 2.3% |
| 40 | 5006 | 2.3% |
| 75 | 4392 | 2.0% |
| 90 | 4385 | 2.0% |
| 35 | 4093 | 1.9% |
| Other values (1556) | 164113 |
| Value | Count | Frequency (%) |
| 0 | 50 | < 0.1% |
| 1 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
| 6 | 3 | < 0.1% |
| 7 | 4 | < 0.1% |
| 8 | 24 | < 0.1% |
| 9 | 35 | < 0.1% |
| 10 | 137 | |
| 11 | 53 | < 0.1% |
| 12 | 98 |
| Value | Count | Frequency (%) |
| 1000046 | 9 | |
| 899977 | 1 | < 0.1% |
| 749980 | 1 | < 0.1% |
| 689509 | 1 | < 0.1% |
| 392206 | 1 | < 0.1% |
| 299992 | 1 | < 0.1% |
| 249958 | 1 | < 0.1% |
| 211652 | 1 | < 0.1% |
| 200031 | 3 | < 0.1% |
| 149996 | 1 | < 0.1% |
minimum_nights
Real number (ℝ)
SKEWED 
| Distinct | 164 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.2580227 |
| Minimum | 1 |
|---|---|
| Maximum | 1250 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 4 |
| 95-th percentile | 21 |
| Maximum | 1250 |
| Range | 1249 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 20.118261 |
|---|---|
| Coefficient of variation (CV) | 3.8262027 |
| Kurtosis | 1129.4337 |
| Mean | 5.2580227 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 26.596257 |
| Sum | 1156928 |
| Variance | 404.74443 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 72425 | |
| 2 | 60093 | |
| 3 | 32232 | |
| 5 | 11977 | 5.4% |
| 4 | 11538 | 5.2% |
| 7 | 10007 | 4.5% |
| 30 | 6004 | 2.7% |
| 6 | 2962 | 1.3% |
| 14 | 2433 | 1.1% |
| 10 | 1878 | 0.9% |
| Other values (154) | 8482 | 3.9% |
| Value | Count | Frequency (%) |
| 1 | 72425 | |
| 2 | 60093 | |
| 3 | 32232 | |
| 4 | 11538 | 5.2% |
| 5 | 11977 | 5.4% |
| 6 | 2962 | 1.3% |
| 7 | 10007 | 4.5% |
| 8 | 402 | 0.2% |
| 9 | 212 | 0.1% |
| 10 | 1878 | 0.9% |
| Value | Count | Frequency (%) |
| 1250 | 1 | < 0.1% |
| 1125 | 6 | |
| 1124 | 4 | |
| 1118 | 1 | < 0.1% |
| 1000 | 8 | |
| 999 | 7 | |
| 900 | 1 | < 0.1% |
| 800 | 2 | < 0.1% |
| 750 | 1 | < 0.1% |
| 720 | 1 | < 0.1% |
number_of_reviews
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 548 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 20.129164 |
| Minimum | 0 |
|---|---|
| Maximum | 896 |
| Zeros | 54248 |
| Zeros (%) | 24.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 4 |
| Q3 | 19 |
| 95-th percentile | 98 |
| Maximum | 896 |
| Range | 896 |
| Interquartile range (IQR) | 18 |
Descriptive statistics
| Standard deviation | 43.012277 |
|---|---|
| Coefficient of variation (CV) | 2.1368139 |
| Kurtosis | 32.419453 |
| Mean | 20.129164 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 4.6628449 |
| Sum | 4429040 |
| Variance | 1850.056 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 54248 | |
| 1 | 24326 | 11.1% |
| 2 | 15146 | 6.9% |
| 3 | 11127 | 5.1% |
| 4 | 8707 | 4.0% |
| 5 | 6964 | 3.2% |
| 6 | 5958 | 2.7% |
| 7 | 5171 | 2.4% |
| 8 | 4594 | 2.1% |
| 9 | 4063 | 1.8% |
| Other values (538) | 79727 |
| Value | Count | Frequency (%) |
| 0 | 54248 | |
| 1 | 24326 | |
| 2 | 15146 | 6.9% |
| 3 | 11127 | 5.1% |
| 4 | 8707 | 4.0% |
| 5 | 6964 | 3.2% |
| 6 | 5958 | 2.7% |
| 7 | 5171 | 2.4% |
| 8 | 4594 | 2.1% |
| 9 | 4063 | 1.8% |
| Value | Count | Frequency (%) |
| 896 | 1 | |
| 825 | 1 | |
| 756 | 1 | |
| 716 | 1 | |
| 706 | 1 | |
| 682 | 1 | |
| 658 | 1 | |
| 654 | 1 | |
| 652 | 1 | |
| 648 | 1 |
last_review
Date
MISSING 
| Distinct | 2804 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 54371 |
| Missing (%) | 24.7% |
| Memory size | 1.7 MiB |
| Minimum | 2010-04-19 00:00:00 |
|---|---|
| Maximum | 2021-12-06 00:00:00 |
reviews_per_month
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 1125 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 54371 |
| Missing (%) | 24.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.2567716 |
| Minimum | 0.01 |
|---|---|
| Maximum | 58.5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 0.01 |
|---|---|
| 5-th percentile | 0.04 |
| Q1 | 0.2 |
| median | 0.69 |
| Q3 | 1.8 |
| 95-th percentile | 4.3 |
| Maximum | 58.5 |
| Range | 58.49 |
| Interquartile range (IQR) | 1.6 |
Descriptive statistics
| Standard deviation | 1.5244489 |
|---|---|
| Coefficient of variation (CV) | 1.2129881 |
| Kurtosis | 33.752985 |
| Mean | 1.2567716 |
| Median Absolute Deviation (MAD) | 0.58 |
| Skewness | 2.9701195 |
| Sum | 208196.78 |
| Variance | 2.3239445 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 3884 | 1.8% |
| 0.04 | 3476 | 1.6% |
| 0.03 | 3216 | 1.5% |
| 0.02 | 2878 | 1.3% |
| 0.09 | 2808 | 1.3% |
| 0.05 | 2806 | 1.3% |
| 0.06 | 2800 | 1.3% |
| 0.08 | 2496 | 1.1% |
| 0.07 | 2477 | 1.1% |
| 0.1 | 2166 | 1.0% |
| Other values (1115) | 136653 | |
| (Missing) | 54371 | 24.7% |
| Value | Count | Frequency (%) |
| 0.01 | 277 | 0.1% |
| 0.02 | 2878 | |
| 0.03 | 3216 | |
| 0.04 | 3476 | |
| 0.05 | 2806 | |
| 0.06 | 2800 | |
| 0.07 | 2477 | |
| 0.08 | 2496 | |
| 0.09 | 2808 | |
| 0.1 | 2166 |
| Value | Count | Frequency (%) |
| 58.5 | 1 | |
| 51.21 | 1 | |
| 45.15 | 1 | |
| 39.38 | 1 | |
| 27.95 | 1 | |
| 23.73 | 1 | |
| 23.69 | 1 | |
| 23.28 | 1 | |
| 20.94 | 1 | |
| 20.13 | 1 |
calculated_host_listings_count
Unsupported
REJECTED  UNSUPPORTED 
| Missing | 0 |
|---|---|
| Missing (%) | 0.0% |
| Memory size | 1.7 MiB |
availability_365
Unsupported
REJECTED  UNSUPPORTED 
| Missing | 0 |
|---|---|
| Missing (%) | 0.0% |
| Memory size | 1.7 MiB |
price(€)
Real number (ℝ)
HIGH CORRELATION  SKEWED 
| Distinct | 3180 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 135.51577 |
| Minimum | 0 |
|---|---|
| Maximum | 14843.87 |
| Zeros | 50 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 28.17 |
| Q1 | 52.83 |
| median | 88.98 |
| Q3 | 149.71 |
| 95-th percentile | 354.71 |
| Maximum | 14843.87 |
| Range | 14843.87 |
| Interquartile range (IQR) | 96.88 |
Descriptive statistics
| Standard deviation | 274.37403 |
|---|---|
| Coefficient of variation (CV) | 2.0246649 |
| Kurtosis | 643.09845 |
| Mean | 135.51577 |
| Median Absolute Deviation (MAD) | 43.05 |
| Skewness | 21.199286 |
| Sum | 29817671 |
| Variance | 75281.107 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 60.12 | 3116 | 1.4% |
| 120.24 | 3066 | 1.4% |
| 48.1 | 2750 | 1.2% |
| 42.08 | 2450 | 1.1% |
| 144.29 | 2393 | 1.1% |
| 180.36 | 2374 | 1.1% |
| 54.11 | 2325 | 1.1% |
| 36.07 | 2307 | 1.0% |
| 72.15 | 2277 | 1.0% |
| 96.19 | 2254 | 1.0% |
| Other values (3170) | 194719 |
| Value | Count | Frequency (%) |
| 0 | 50 | |
| 1.2 | 1 | < 0.1% |
| 3.67 | 2 | < 0.1% |
| 4.81 | 1 | < 0.1% |
| 7.21 | 1 | < 0.1% |
| 7.34 | 2 | < 0.1% |
| 7.35 | 2 | < 0.1% |
| 8 | 3 | < 0.1% |
| 8.07 | 46 | |
| 8.42 | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 14843.87 | 1 | < 0.1% |
| 12024.2 | 1 | < 0.1% |
| 12023 | 3 | < 0.1% |
| 11999 | 1 | < 0.1% |
| 10000 | 1 | < 0.1% |
| 9999 | 17 | |
| 9856 | 2 | < 0.1% |
| 9785 | 1 | < 0.1% |
| 9619.36 | 1 | < 0.1% |
| 9441.4 | 1 | < 0.1% |
neighbourhood_group
Categorical
HIGH CORRELATION  MISSING 
| Distinct | 26 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 151518 |
| Missing (%) | 68.9% |
| Memory size | 1.7 MiB |
| Manhattan | |
|---|---|
| Brooklyn | |
| Centro | |
| Queens | |
| Salamanca | 1324 |
| Other values (21) |
Length
| Max length | 21 |
|---|---|
| Median length | 18 |
| Mean length | 8.2834207 |
| Min length | 5 |
Characters and Unicode
| Total characters | 567.522 |
|---|---|
| Distinct characters | 42 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ChamartÃn |
|---|---|
| 2nd row | Latina |
| 3rd row | Arganzuela |
| 4th row | Centro |
| 5th row | Arganzuela |
Common Values
| Value | Count | Frequency (%) |
| Manhattan | 21661 | 9.8% |
| Brooklyn | 20104 | 9.1% |
| Centro | 8649 | 3.9% |
| Queens | 5666 | 2.6% |
| Salamanca | 1324 | 0.6% |
| Chamberà | 1252 | 0.6% |
| Arganzuela | 1104 | 0.5% |
| Bronx | 1091 | 0.5% |
| Tetuán | 816 | 0.4% |
| Carabanchel | 708 | 0.3% |
| Other values (16) | 6138 | 2.8% |
| (Missing) | 151518 |
Length
| Value | Count | Frequency (%) |
| manhattan | 21661 | |
| brooklyn | 20104 | |
| centro | 8649 | 11.6% |
| queens | 5666 | 7.6% |
| 1366 | 1.8% | |
| salamanca | 1324 | 1.8% |
| chamberà | 1252 | 1.7% |
| arganzuela | 1104 | 1.5% |
| bronx | 1091 | 1.5% |
| tetuán | 816 | 1.1% |
| Other values (26) | 11476 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 89087 | |
| n | 87847 | |
| t | 56484 | |
| o | 52589 | |
| r | 36834 | 6.5% |
| e | 30021 | 5.3% |
| l | 29471 | 5.2% |
| h | 24201 | 4.3% |
| M | 22333 | 3.9% |
| B | 21864 | 3.9% |
| Other values (32) | 116791 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 567522 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 89087 | |
| n | 87847 | |
| t | 56484 | |
| o | 52589 | |
| r | 36834 | 6.5% |
| e | 30021 | 5.3% |
| l | 29471 | 5.2% |
| h | 24201 | 4.3% |
| M | 22333 | 3.9% |
| B | 21864 | 3.9% |
| Other values (32) | 116791 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 567522 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 89087 | |
| n | 87847 | |
| t | 56484 | |
| o | 52589 | |
| r | 36834 | 6.5% |
| e | 30021 | 5.3% |
| l | 29471 | 5.2% |
| h | 24201 | 4.3% |
| M | 22333 | 3.9% |
| B | 21864 | 3.9% |
| Other values (32) | 116791 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 567522 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 89087 | |
| n | 87847 | |
| t | 56484 | |
| o | 52589 | |
| r | 36834 | 6.5% |
| e | 30021 | 5.3% |
| l | 29471 | 5.2% |
| h | 24201 | 4.3% |
| M | 22333 | 3.9% |
| B | 21864 | 3.9% |
| Other values (32) | 116791 |
city
Categorical
HIGH CORRELATION  MISSING 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 103390 |
| Missing (%) | 47.0% |
| Memory size | 1.7 MiB |
| New York | |
|---|---|
| Sidney | |
| Madrid | |
| Tokyo |
Length
| Max length | 8 |
|---|---|
| Median length | 6 |
| Mean length | 6.7400828 |
| Min length | 5 |
Characters and Unicode
| Total characters | 786.170 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Tokyo |
|---|---|
| 2nd row | Tokyo |
| 3rd row | Tokyo |
| 4th row | Tokyo |
| 5th row | Tokyo |
Common Values
| Value | Count | Frequency (%) |
| New York | 48895 | |
| Sidney | 36662 | 16.7% |
| Madrid | 19618 | 8.9% |
| Tokyo | 11466 | 5.2% |
| (Missing) | 103390 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| new | 48895 | |
| york | 48895 | |
| sidney | 36662 | |
| madrid | 19618 | |
| tokyo | 11466 | 6.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 85557 | |
| d | 75898 | |
| o | 71827 | |
| r | 68513 | 8.7% |
| k | 60361 | 7.7% |
| i | 56280 | 7.2% |
| 48895 | 6.2% | |
| w | 48895 | 6.2% |
| Y | 48895 | 6.2% |
| N | 48895 | 6.2% |
| Other values (6) | 172154 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 786170 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 85557 | |
| d | 75898 | |
| o | 71827 | |
| r | 68513 | 8.7% |
| k | 60361 | 7.7% |
| i | 56280 | 7.2% |
| 48895 | 6.2% | |
| w | 48895 | 6.2% |
| Y | 48895 | 6.2% |
| N | 48895 | 6.2% |
| Other values (6) | 172154 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 786170 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 85557 | |
| d | 75898 | |
| o | 71827 | |
| r | 68513 | 8.7% |
| k | 60361 | 7.7% |
| i | 56280 | 7.2% |
| 48895 | 6.2% | |
| w | 48895 | 6.2% |
| Y | 48895 | 6.2% |
| N | 48895 | 6.2% |
| Other values (6) | 172154 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 786170 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 85557 | |
| d | 75898 | |
| o | 71827 | |
| r | 68513 | 8.7% |
| k | 60361 | 7.7% |
| i | 56280 | 7.2% |
| 48895 | 6.2% | |
| w | 48895 | 6.2% |
| Y | 48895 | 6.2% |
| N | 48895 | 6.2% |
| Other values (6) | 172154 |
| city | host_id | id | latitude | longitude | minimum_nights | neighbourhood_group | number_of_reviews | price | price(€) | reviews_per_month | room_type | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| city | 1.000 | 0.275 | 0.375 | 0.708 | 1.000 | 0.017 | 1.000 | 0.076 | 0.026 | 0.026 | 0.022 | 0.110 |
| host_id | 0.275 | 1.000 | 0.529 | -0.019 | 0.040 | -0.179 | 0.142 | -0.141 | -0.003 | -0.068 | 0.212 | 0.058 |
| id | 0.375 | 0.529 | 1.000 | 0.082 | -0.004 | -0.127 | 0.175 | -0.318 | -0.005 | 0.003 | 0.283 | 0.049 |
| latitude | 0.708 | -0.019 | 0.082 | 1.000 | -0.229 | -0.050 | 1.000 | 0.029 | -0.242 | 0.114 | -0.016 | 0.064 |
| longitude | 1.000 | 0.040 | -0.004 | -0.229 | 1.000 | -0.061 | 1.000 | -0.083 | 0.119 | -0.103 | -0.018 | 0.092 |
| minimum_nights | 0.017 | -0.179 | -0.127 | -0.050 | -0.061 | 1.000 | 0.025 | -0.133 | 0.089 | 0.130 | -0.259 | 0.005 |
| neighbourhood_group | 1.000 | 0.142 | 0.175 | 1.000 | 1.000 | 0.025 | 1.000 | 0.062 | 1.000 | 0.041 | 0.046 | 0.145 |
| number_of_reviews | 0.076 | -0.141 | -0.318 | 0.029 | -0.083 | -0.133 | 0.062 | 1.000 | -0.054 | -0.077 | 0.696 | 0.014 |
| price | 0.026 | -0.003 | -0.005 | -0.242 | 0.119 | 0.089 | 1.000 | -0.054 | 1.000 | 0.829 | 0.058 | 0.000 |
| price(€) | 0.026 | -0.068 | 0.003 | 0.114 | -0.103 | 0.130 | 0.041 | -0.077 | 0.829 | 1.000 | -0.022 | 0.028 |
| reviews_per_month | 0.022 | 0.212 | 0.283 | -0.016 | -0.018 | -0.259 | 0.046 | 0.696 | 0.058 | -0.022 | 1.000 | 0.024 |
| room_type | 0.110 | 0.058 | 0.049 | 0.064 | 0.092 | 0.005 | 0.145 | 0.014 | 0.000 | 0.028 | 0.024 | 1.000 |
| id | name | host_id | host_name | neighbourhood | latitude | longitude | room_type | price | minimum_nights | number_of_reviews | last_review | reviews_per_month | calculated_host_listings_count | availability_365 | price(€) | neighbourhood_group | city | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 6400 | The Studio Milan | 13822 | Francesca | TIBALDI | 45.44119 | 9.17813 | Private room | 100 | 4 | 12 | 2010-04-19 | 0.14 | 1 | 358 | 100.0 | NaN | NaN |
| 1 | 23986 | " Characteristic Milanese flat" | 95941 | Jeremy | NAVIGLI | 45.44806 | 9.17373 | Entire home/apt | 150 | 1 | 15 | 2020-07-09 | 0.21 | 1 | 363 | 150.0 | NaN | NaN |
| 2 | 28300 | nice flat near the park | 121663 | Marta | SARPI | 45.47647 | 9.17359 | Private room | 180 | 1 | 8 | 2012-04-22 | 0.11 | 1 | 365 | 180.0 | NaN | NaN |
| 3 | 32119 | Nico & Cynthia's Easy Yellow Suite | 138683 | Nico&Cinzia | VIALE MONZA | 45.52014 | 9.22300 | Entire home/apt | 75 | 2 | 15 | 2018-01-07 | 0.23 | 3 | 200 | 75.0 | NaN | NaN |
| 4 | 32649 | Nico&Cinzia's Red Easy Suite! | 138683 | Nico&Cinzia | VIALE MONZA | 45.51874 | 9.22495 | Entire home/apt | 71 | 2 | 29 | 2016-10-23 | 0.71 | 3 | 308 | 71.0 | NaN | NaN |
| 5 | 37256 | COZY FULLY FURNISHED PRIVATE STUDIO CITY CENTER | 119002 | Giancarlo | BUENOS AIRES - VENEZIA | 45.46884 | 9.20777 | Private room | 55 | 2 | 34 | 2019-05-13 | 0.49 | 2 | 0 | 55.0 | NaN | NaN |
| 6 | 40470 | Giacinto Cosy & clean flat near MM1 | 174203 | Giacinto | VIALE MONZA | 45.52023 | 9.22747 | Entire home/apt | 75 | 3 | 37 | 2017-07-24 | 0.33 | 2 | 350 | 75.0 | NaN | NaN |
| 7 | 42732 | Navigli near down town, linked Expo | 186608 | Francesco | MAGENTA - S. VITTORE | 45.45814 | 9.17654 | Entire home/apt | 199 | 2 | 14 | 2018-04-22 | 0.20 | 2 | 362 | 199.0 | NaN | NaN |
| 8 | 46536 | Nico & Cinzia's Pink Suite! | 138683 | Nico&Cinzia | VIALE MONZA | 45.52276 | 9.22478 | Entire home/apt | 76 | 2 | 27 | 2018-03-07 | 0.23 | 3 | 150 | 76.0 | NaN | NaN |
| 9 | 55055 | BEAUTIFUL MODERN ATTIC CENTER OF MI | 246217 | Cristina | BUENOS AIRES - VENEZIA | 45.48096 | 9.21686 | Entire home/apt | 145 | 3 | 2 | 2016-04-16 | 0.03 | 1 | 365 | 145.0 | NaN | NaN |
| id | name | host_id | host_name | neighbourhood | latitude | longitude | room_type | price | minimum_nights | number_of_reviews | last_review | reviews_per_month | calculated_host_listings_count | availability_365 | price(€) | neighbourhood_group | city | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 220021 | 36482809 | Stunning Bedroom NYC! Walking to Central Park!! | 131529729 | Kendall | East Harlem | 40.79633 | -73.93605 | Private room | 75 | 2 | 0 | NaT | NaN | 2 | 353 | 68.55 | Manhattan | New York |
| 220022 | 36483010 | Comfy 1 Bedroom in Midtown East | 274311461 | Scott | Midtown | 40.75561 | -73.96723 | Entire home/apt | 200 | 6 | 0 | NaT | NaN | 1 | 176 | 182.80 | Manhattan | New York |
| 220023 | 36483152 | Garden Jewel Apartment in Williamsburg New York | 208514239 | Melki | Williamsburg | 40.71232 | -73.94220 | Entire home/apt | 170 | 1 | 0 | NaT | NaN | 3 | 365 | 155.38 | Brooklyn | New York |
| 220024 | 36484087 | Spacious Room w/ Private Rooftop, Central location | 274321313 | Kat | Hell's Kitchen | 40.76392 | -73.99183 | Private room | 125 | 4 | 0 | NaT | NaN | 1 | 31 | 114.25 | Manhattan | New York |
| 220025 | 36484363 | QUIT PRIVATE HOUSE | 107716952 | Michael | Jamaica | 40.69137 | -73.80844 | Private room | 65 | 1 | 0 | NaT | NaN | 2 | 163 | 59.41 | Queens | New York |
| 220026 | 36484665 | Charming one bedroom - newly renovated rowhouse | 8232441 | Sabrina | Bedford-Stuyvesant | 40.67853 | -73.94995 | Private room | 70 | 2 | 0 | NaT | NaN | 2 | 9 | 63.98 | Brooklyn | New York |
| 220027 | 36485057 | Affordable room in Bushwick/East Williamsburg | 6570630 | Marisol | Bushwick | 40.70184 | -73.93317 | Private room | 40 | 4 | 0 | NaT | NaN | 2 | 36 | 36.56 | Brooklyn | New York |
| 220028 | 36485431 | Sunny Studio at Historical Neighborhood | 23492952 | Ilgar & Aysel | Harlem | 40.81475 | -73.94867 | Entire home/apt | 115 | 10 | 0 | NaT | NaN | 1 | 27 | 105.11 | Manhattan | New York |
| 220029 | 36485609 | 43rd St. Time Square-cozy single bed | 30985759 | Taz | Hell's Kitchen | 40.75751 | -73.99112 | Shared room | 55 | 1 | 0 | NaT | NaN | 6 | 2 | 50.27 | Manhattan | New York |
| 220030 | 36487245 | Trendy duplex in the very heart of Hell's Kitchen | 68119814 | Christophe | Hell's Kitchen | 40.76404 | -73.98933 | Private room | 90 | 7 | 0 | NaT | NaN | 1 | 23 | 82.26 | Manhattan | New York |